Enforcing Monotonous Shape Growth or Shrinkage in Video Segmentation
نویسندگان
چکیده
One of the great challenges in computer vision is automatic segmentation of objects in videos. This task becomes more difficult when image sequences are subject to low signal-to-noise ratio or low contrast between intensities of neighboring structures. Such challenging data are acquired routinely, for example, in medical imaging or satellite remote sensing. While individual frames can be analyzed independently, temporal coherence in image sequences provides a lot of information not available for a single image. In this work, we focus on segmenting shapes which only grow or shrink in time, from sequences of extremely noisy images. We consider image sequences, where both foreground and background intensity distributions can vary significantly over time, foreground can be heavily occluded or undistinguishable from a part of the background. Most of previously-proposed spatio-temporal methods rely on coherence of foreground/background intensity distributions in consecutive image frames, and therefore fail when segmenting such noisy data sets. Few approaches have been designed for spatio-temporal segmentation of shapes from magnetic resonance (MR) images with low signalto-noise ratio [3, 4]. Applied to multi-temporal sequences that show a monotonously growing or shrinking structure, these smoothing methods bias results towards the mean shape obtained from averaging consecutive segmentations and, hence, underestimate rapid growth or shrinkage events. To address this issue, we propose a new segmentation framework based on graph cuts for the joint segmentation of an image sequence. It introduces growth or shrinkage constraint in graph cuts by using directed infinite links, connecting pixels at the same spatial locations in successive image frames. By minimizing an energy computed on the resulting spatio-temporal graph of the image set, the proposed method yields a globally optimal solution. Differently from the state-of-the-art spatiotemporal techniques, it does not rely on the coherence of the intensity in time, but only on the coherence of the shape. Graph cut is an optimization tool, which can be used to find the globally optimal binary segmentation of images [1], where the segmentation criterion E is related to a Markov Random Field with submodular interaction terms:
منابع مشابه
SIDF: A Novel Framework for Accurate Surgical Instrument Detection in Laparoscopic Video Frames
Background and Objectives: Identification of surgical instruments in laparoscopic video images has several biomedical applications. While several methods have been proposed for accurate detection of surgical instruments, the accuracy of these methods is still challenged high complexity of the laparoscopic video images. This paper introduces a Surgical Instrument Detection Framework (SIDF) for a...
متن کاملIntegrated Video Object Segmentation and Shape Coding
To enable content-based access and manipulation of video content, modern multimedia communications require per-object based access to video data. Two additional building blocks of video transmission systems needs to be defined, which enable access to video objects at the receiver end. These are video segmentation, which decomposes video frame into a set of layers, where each layer includes segm...
متن کاملA Study of Actor and Action Semantic retention in Video Supervoxel Segmentation
Existing methods in the semantic computer vision community seem unable to deal with the explosion and richness of modern, open-source and social video content. Although sophisticated methods such as object detection or bag-of-words models have been well studied, they typically operate on low level features and ultimately suffer from either scalability issues or a lack of semantic meaning. On th...
متن کاملرویکردهای متفاوت به هزینه مبادله و دلالتهای آن برای محیط کسب و کار و اقتصاد ایران
This paper tries to indicate that new institutional economy has various origins and does not have an integrate wholeness. Lack of integraty in new institutional economy is reflected in its approach toward transaction costs there are two approaches toward transaction costs, one of which focuses on the firm and its relations and the other emphasizes on macro-infrastructures. Considering North’s a...
متن کاملSegmentation According to Natural Examples: Learning Static Segmentation from Motion Segmentation (pre-print)
The Segmentation According to Natural Examples (SANE) algorithm learns to segment objects in static images from video training data. SANE uses background subtraction to find the segmentation of moving objects in videos. This provides object segmentation information for each video frame. The collection of frames and segmentations forms a training set that SANE uses to learn the image and shape p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013